Network  Support  for  Group  Coordination 

Hans-Peter  Dommel 

Computer  Engineering  Department,  Santa  Clara  University 
Santa  Clara,  CA  95053,  USA 
and 

J.  J.  Garcia-Luna-Aceves 
Raskin  School  of  Engineering 

Computer  Engineering  Department,  University  of  California 
"  Santa  Cruz,  CA  95064,  USA 

ABSTRACT 

Recent  advances  in  computer  hardware  and  networking  technology  have  incited  the  deployment  of  wide-area  streaming 
media  services  in  the  Internet.  While  such  efforts  as  video-on-demand  are  largely  limited  to  unidirectional  delivery  of  content 
to  the  desktop,  synchronously  interactive  group-oriented  application  services  are  foreseeable.  In  such  applications,  users 
collaborate  on  a  shared  workspace  and  freely  exchange  information  in  real-time  under  the  premise  of  coordination  and 
conflict  freedom.  Telecollaborative  applications  such  as  telemedicine  or  distance  learning  may  profit  from  such  coordination 
services.  Ultimately,  group  coordination  allows  for  groupware-style  computing  at  Internet  scope.  The  current  IP-multicast 
framework  contains  provisions  for  group  membership  control  and  reliable  dissemination  services,  however,  it  lacks  support 
for  group  coordination.  In  this  paper,  we  present  a  framework  on  network  control  and  coordination  functions  to  orchestrate 
synchronous  multimedia  groupwork.  Our  goal  is  to  achieve  a  better  understanding  of  the  group  coordination  problem  as  an 
important  component  of  future  Internet  multimedia  collaboration  tools. 
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1.  INTRODUCTION 

The  proliferation  of  Internet  services  in  recent  years,  in  particular  the  World  Wide  Web,  indicates  the  high  demand  for  sharing 
of  information  through  computer  networks.  A  wider  distribution  of  the  work  force,  in  form  of  telecommuting  and  ubiquitous 
computing  [40],  the  advent  of  networked  multimedia,  and  less  expensive  technology  have  shifted  telecollaboration  into  the 
spotlight  of  mainstream  computing.  Telecollaboration  comes  in  many  faces,  such  as  email,  instant  messaging,  chat  tools, 
application  sharing,  and  real-time  interaction  on  the  same  media  or  resources,  qualified  by  the  increasing  degree  of  mutual 
awareness  and  the  ability  for  instant  information  exchange  and  manipulation.  Synchronous  telecollaboration  enables  people 
in  different  geographic  locations  to  share  and  jointly  manipulate  multimedia  information  in  real-time  and  at  various  levels  of 
granularity,  bridging  time  and  space.  This  aspect  stands  in  contrast  to  legacy  client-server  applications  such  as  Internet  radio 
broadcast  or  video-on-demand,  and  to  asynchronous,  document-centric  collaboration  tools  like  email,  instant  messaging,  or 
chat  rooms.  Representative  application  areas  are  collaborative  virtual  environments  [6],  distributed  real-time  gaming  environ¬ 
ments  [7],  distributed  interactive  simulations  (DIS)  [14],  collaboratories  [17],  distance  learning  [23], and  telemedicine  [27]. 

Limitations  in  the  availability  and  accessibility  of  resources  in  the  shared  workspace  of  a  telecollaborative  system  create 
contention,  competition,  and  conflict  among  users  and  make  it  necessary  to  deploy  coordination  mechanisms  to  reach  consen¬ 
sus  on  how  to  jointly  and  effectively  use  the  resources.  Conflicts  stalling  the  workflow  may  occur  before  and  during  resource 
allocation  to  users,  as  well  as  during  actual  usage.  Telecollaborative  services  build  on  the  provision  of  group  coordination 
mechanisms.  These  manage  access,  manipulation,  distribution  and  presentation  issues  between  users  and  shared  resources. 
Such  coordination  mechanisms  are  necessary  to  allow  users  to  achieve  individual  goals  in  the  context  of  group-centered  re¬ 
mote  interaction,  when  telepresence  [3]  substitutes  for  physical  presence.  Group  coordination  services  support  distributed 
hosts  in  coordinating  their  joint  activities,  to  prevent  or  resolve  resource  contention,  conflict  and  inconsistencies  in  the  syn¬ 
chronous  sharing  of  resources.  Group  coordination  protocols,  which  embrace  multicasting  and  consider  network  conditions 
in  the  coordination  processes  between  hosts,  complement  efforts  on  group  membership  known  from  distributed  systems  and 
multicasting  as  an  efficient  message  dissemination  mechanism  for  group  communication. 

In  this  paper,  we  focus  on  network  support  for  synchronous  multimedia  groupwork.  We  envision  a  new  generation  of 
collaborative  multimedia  systems  using  group  coordination  middleware  to  facilitate  multipoint,  multiparty,  multichannel,  and 
multimedia  communication  in  small  to  very  large  groups  and  Internet  scope.  In  such  systems,  groups  and  individuals  can 
selectively,  securely,  and  efficiently  cocreate  and  disseminate  information  with  improved  telepresence  and  mutual  awareness. 
The  paper  organization  is  as  follows:  Section  2  reviews  related  work.  Section  3  discusses  relevant  components  for  network 
support  of  coordination  services.  Section  4  outlines  key  architectural  issues  in  the  design  of  group-coordinative  systems.  We 
conclude  the  paper  in  Section  5. 
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2.  BACKGROUND 


Cerf  et  al.  [5]  pointed  out  the  importance  of  transatlantic  collaboration  infrastructures  in  a  memorandum  in  1991.  Our  work  is 
centered  at  the  interface  between  legacy  groupware  [12],  computer-mediated  communication  [25],  and  computer-supported 
cooperative  work  (CSCW)  [31].  In  their  coordination  theory  framework,  Malone  and  Crowston  [21]  define  coordination  as 
the  “act  of  managing  interdependencies  between  activities  performed  to  achieve  a  goal”,  looking  at  components  of  actors 
(people)  and  agents  (computerized  procedures),  identifying  workgoals,  mapping  goals  to  activities,  and  managing  interde¬ 
pendencies  among  actors  and  activities.  They  distinguish  between  generic  interdependencies,  for  instance  sequenced  or 
simultaneous  actions  on  shared  resources,  and  domain-specific  interdependencies,  e.g.,  specific  data  elements  that  must  be 
passed  between  team  members  to  achieve  successful  groupwork.  Schmidt  and  Simone  [34]  present  an  empirical  charac¬ 
terization  of  computational  coordination  mechanisms  useful  as  general  blueprint  for  the  design  of  coordination  protocols, 
proposing  for  instance  the  construction  of  a  mechanism  such  that  “actors  are  able  to  control  its  execution  and  make  local  and 
temporary  modifications  of  its  behavior  to  cope  with  unforeseen  contingencies”. 

Axelrod  [1]  investigated  cooperation  from  a  game-theoretic  perspective,  specifically,  how  the  tradeoff  between  individual 
greed  and  good  affects  coordinative  strategies  in  groups,  assuming  rational  behavior.  This  problem  is  also  known  as  the 
social  dilemma.  Focusing  on  “tit-for-tat”  games,  interactions  are  interpreted  as  pairwise  alternations  of  moves  with  specific 
payoff  values.  The  pay-off  structure  of  the  interaction  determines  the  game  motive.  For  two  participants  A  and  B,  the  payoff 
structure  for  choosing  two  actions  i  and  j  is  P  =  A,j  +  Bjj.  If  P  =  0,  then  the  interaction  is  called  a  zero-sum  game,  and 
interactions  with  P  f  0  are  called  cooperative  or  mixed-motive  games. 

A  related  approach  uses  economic  models  to  tackle  resource  allocation  in  computer  systems  from  a  market-oriented  per¬ 
spective  [15].  A  cost  function  is  assigned  to  cooperative  activities,  individual  negotiations,  deals,  and  strategies.  An  activity 
between  two  subjects  is  pareto-optimal  if  it  is  not  possible  to  improve  the  utility  for  one  subject  without  lowering  the  utility 
of  the  other  subject.  A  strategy  to  determine  the  progress  of  activities  is  said  to  be  in  equilibrium  if  no  party  has  an  incentive 
to  diverge  from  that  strategy  in  order  to  fulfill  individual  and  group  tasks.  Multiple  equilibria  are  possible  and  two  strategies 
S  and  T  are  said  to  be  in  Nash-equilibrium  [24]  if  one  party  cannot  do  better  other  than  using  T,  when  the  other  party  uses  S, 
i.e.,  the  product  of  individual  utility  values  is  maximized.  However,  it  is  not  simple  to  assess  global  utility  values,  and  choos¬ 
ing  one  of  several  possible  equilibrium  points  may  guarantee  relative  fairness  but  restricts  the  space  of  possible  agreement 
states,  under  the  assumption  that  all  subjects  employ  the  same  utility  measure  and  do  not  cheat.  ContractNet  [36]  was 
an  early  market-based  protocol  approach  towards  distributed  task-completion,  employing  a  bidding  scheme  among  managing 
and  contracting  nodes.  Shenker  [35]  argues  that  applying  a  fair-share  service  discipline  at  network  switches  models  uncoop¬ 
erative  flow  control  satisfying  individual  users’  selfishness  more  realistically  than  traditional  disciplines,  which  presuppose 
cooperation  such  as  First-Come-First-Served  among  users. 

3.  GROUP  COORDINATION  FRAMEWORK 

We  define  coordination  as  an  interactive  scheduling  process  between  two  or  more  users  forming  a  group  to  achieve  joint 
work  goals.  Coordination  correlates  with  cooperation,  which  we  understand  as  the  joint  acting  of  individuals  for  a  mutual 
benefit  -  in  our  context  the  mutual  sharing  of  information  for  data  mining  or  other  forms  of  data  exchange.  We  envision 
network-centric  group  coordination  services  to  support  distributed  hosts  in  coordinating  their  joint  activities,  to  prevent  or 
resolve  resource  contention,  conflict  and  inconsistencies  in  the  synchronous  sharing  of  resources.  Key  components  for  such 
services  are  the  management  of  distributed  resource  access  [8],  ordered,  reliable  message  dissemination  [10],  security  [16], 
and  stream  synchronization  [20],  Coordination  and  cooperation  among  users  in  networked  multimedia  systems  support  the 
process  of  multimedia  collaboration  [30],  which  is  the  actual  act  of  users  working  together  online. 

We  present  a  formal  view  on  entities  and  actions  refining  earlier  efforts  [29,  30]  on  the  definition  of  coordination  and 
control  processes  in  collaborative  multimedia  systems.  Our  work  is  process-oriented  and  differs  from  Candan  et  al.  [4]  who 
concentrate  on  algorithms  for  collaborative  composition  and  transmission  of  media  objects  under  given  quality  constraints, 
and  their  presentation  in  collaborative  sessions. 

We  picture  a  computer  network  as  a  graph  with  nodes  (stations,  hosts)  V  sending  messages  across  links  (channels) 
E  C  V  x  V.  A  connection  is  a  unidirectional  or  bidirectional  transmission  link  from  a  sender  node  to  a  set  of  receiver  nodes. 

DEFINITION  1 .  A  collaboration  environment  F  in  a  computer  network  is  a  tuple 

r  =<s,u,n,E>  (i) 

where  S  =  (V,  E)  is  a  set  of  sessions  £,  U  is  a  set  of  users  (hosts,  processes,  agents,  participants ),  1Z  is  a  set  of  shared 
resources  (media),  and  T  is  a  set  of  floors  controlling  the  resources. 

A  session  provides  the  infrastructure  for  coordination,  cooperation  and  collaboration. 


Definition  2.  A  session  E  E  S  is  a  tuple 


S  =<  Sid,Ti,Te,As,L  >  (2) 

where  Sid  is  a  unique  identifier  within  F,  T,  is  the  initiation  or  announcement  time,  Te  is  the  ending  time,  and  As  is  a  list  of 
attributes  characterizing  the  session  at  level  L.  A  conference  is  a  set  of  sessions  E,  €  S,  where  i  >  1. 

Sid  is  a  unique  session  identifier  per  collaborative  environment,  whose  sequence  number  space  is  wrapped  around  in 
correlation  with  the  turnover  rate  and  lifetime  of  sessions  in  F.  The  time  may  reflect  real-time,  logical  time,  or  define  a 
lifetime  interval  A  =  Te  —  1) .  L  denotes  the  session  level  (default  0).  ,4s  =  (M,  O,  C)  describes  purpose  and  orchestration 
of  a  session  in  terms  of  membership  M,  organization  O,  and  control  C.  Sessions  can  be  flat  (L  =  1)  or  maintain  two  or  more 
levels  with  nested  groups  ( L  >  1). 

Szyperski  [39]  characterizes  session  types  in  a  similar,  but  less  refined  way,  according  to  the  model  of  interaction  (con¬ 
trolled,  dynamic,  static)  and  data  flow  (1  —  n,  n  —  1,  m  —  n).  For  instance,  a  lecture  is  a  controlled,  long-term  interaction 
between  one  sender  and  n  receivers.  Telemetry  is  a  typical  n  —  1  session,  and  a  whiteboard  session  is  typically  rn  —  n. 
Our  session  characterization  applies  to  specific  collaborative  applications,  as  well  as  generic  session  types  in  the  spectrum 
of  real-time  collaborative  work,  such  as  lectures,  business  meetings,  labs,  panels,  brainstorm  meetings,  exams,  interviews,  or 
chats. 

Membership  ( M )  reflects  the  composure  of  the  user  group  in  the  session.  Participation  specifies  whether  information 
is  exchanged  unilaterally,  or  bilaterally  relative  to  a  host,  impacting  user  access  rights  and  data-flow.  Interactive  sessions 
may  be  symmetric,  i.e.,  all  users  have  the  same  view  on  shared  resources  (WYSIWIS),  or  asymmetric,  where  users  pertain 
individual  views  on  the  same  shared  data  space  (relaxed  WYSIWIS)  [37].  Size  specifies  a  small  (<  5),  medium  (<  100),  or 
large  (>  100)  number  of  users,  impacting  scalability  of  the  coordination  mechanism.  Accessibility  declares  whether  a  session 
is  open,  allowing  any  user  to  join,  whereas  closed  sessions  allow  participation  by  invitation  only.  Authorization  specifies 
whether  coordination  primitives  may  use  read-only,  read-write,  or  write-only  privileges  for  the  entire  session.  Users  may 
have  individual,  role-based  authorizations,  as  well. 

Organization  ( O )  entails  specifics  on  how  the  session  is  to  be  orchestrated.  Dataflow  describes  how  data  are  multiplexed 
among  users,  with  a  1  —  1, 1  —  n,  or  1  —  m  transmission  model  and  with  unicast,  broadcast,  or  multicast  in  a  session  of  n  users, 
where  m  <  n.  Delivery  can  be  ordered  or  unordered.  Duration  discerns  between  sessions  with  longer  lifetime  (persistent)  vs. 
short-term  sessions,  where  the  precise  timing  modalities  are  case-specific  and  left  open.  The  scope  specifies  the  hop  limit  for 
packets  sent  by  hosts  in  a  particular  session,  similar  to  the  Time-To-Live  semantics  in  IP,  which  allows  constraining  sessions 
to  a  geographic  range  and  retain  privacy  or  limited  dissemination  to  a  specific  group.  Media  composition  defines  whether  the 
session  uses  a  single  medium  such  as  audio-only,  or  mixed  media,  e.g.,  a  video-audio  combination.  Conduction  refers  to  the 
session  agenda  and  moderation  style,  which  can  be  either  tightly  coupled,  i.e.,  all  users  know  about  each  other  and  follow 
some  agenda  in  the  style  of  “Robert’s  Rules  of  Order”  [32],  or  the  exchange  is  loosely-coupled  and  not  prescribed. 

Control  ( C )  depicts  the  status,  locus  of  control,  and  security  measures  activated  for  a  session.  Sessions  with  overlapping 
or  diverging  interests  can  merge  or  split.  Such  reconfiguration  of  sessions  with  regard  to  membership  and  session  events 
linked  to  specific  phases  must  be  possible  without  session  termination  or  restart  of  applications.  The  session  status  marks 
whether  the  session  is  a  partition  from  a  larger  session,  frozen  but  still  deemed  as  active,  merged  or  revived.  Tracking  of 
states  in  coordination  protocols  and  the  outcome  of  coordination  processes  can  be  logged  and  persistent,  or  ephemeral. 

Locus  of  control  specifies,  whether  membership  and  floor  control  are  being  handled  in  one  central  location,  partially 
distributed  among  several  servers,  or  fully  distributed  across  all  hosts.  Partial  or  full  replication  is  possible  for  the  latter 
two  paradigms.  A  central  controller  can  also  rove  among  all  sites  and  achieve  better  fault  tolerance.  Distributed  control 
is  multilateral,  with  varying  degrees  of  “consentience”  and  “equipollence”,  i.e.,  how  much  everybody  participates  and  how 
authorities  and  responsibilities  are  allocated.  Multilateral  control  is  either  successive,  partitioned,  democratic  or  anarchic. 
Successive  controllership  allows  one  distinct  controller  at  a  time,  and  alternates  among  users,  and  partitioned  control  lets 
several  controllers  each  perform  a  subset  of  control  operations.  Democratic  control  lets  all  users  contribute  to  the  control 
process,  e.g.,  via  voting.  Anarchic  control  gives  all  subjects  complete  freedom  of  acting  and  control  of  sharing  is  peer-to-peer 
based. 

The  control  locus  is  related  to  the  supervision  attribute,  indicating  whether  the  communication  process  in  coordination 
is  moderated,  peer-reviewed,  or  free.  A  moderator  decides  which  users  may  send  information,  what  is  forwarded  to  the 
receivers,  or  which  receivers  may  receive  a  particular  content  or  access  a  specific  resource,  implementing  a  notion  of  floor 
control.  McKinlay  et  al.  [22]  note  for  face-to-face  meetings  that  the  importance  of  chaired  guidance  increases  with  the 
session  size,  and  the  difficulty  in  performing  a  joint  task,  since  each  member’s  ability  to  participate  and  influence  others  is 
reduced.  Finally,  coordination  touches  upon  security  issues,  specifying  whether  users  are  anonymous  or  authenticated  in  their 
exchanges,  either  at  session  initiation,  or  at  every  turn,  and  whether  information  is  encrypted.  Rajan  et  al.  [29]  identify  a 
confluence  as  a  special  session  type,  where  all  participants  transmit  and  receive  the  same  set  of  media  streams  mixed  together 


in  broadcast,  which  saves  bandwidth.  The  notion  of  confluences  and  session  nesting  leads  to  the  concept  of  multilevel  or 
hierarchical  sessions,  whose  discussion  we  omit  for  space  reasons. 

Definition  3.  A  user  U  £  U  is  a  tuple 


U  =<  Uid,  Sid,  Loc,Tj,  Ti,  Au  >  (3) 

where  Uid  is  a  unique  identifier  within  the  session  Sid,  Loc  is  the  local  or  remote  location,  given  as  IP-address  or  unique 
host  identifier,  Tj  is  the  joining  time,  Tj  is  the  leaving  time,  and  Ajj  is  a  list  of  user  attributes. 

Users  can  be  represented  by  system  agents  [11,  19].  Accordingly,  users  are  characterized  by  their  roles,  authority,  identity, 
entry  capabilities  and  access  rights,  which  impact  the  applicable  floor  control  strategy.  Users  can  be  co-located  in  the  same 
space,  or  geographically  distributed.  We  distinguish  between  social  and  system  roles.  Social  roles  describe  the  function  of 
a  user  within  a  session,  e.g.,  being  a  panelist  or  lecturer.  System  roles  refer  to  the  control  function  within  a  floor  control 
protocol:  participants  without  a  specific  role  can  be  either  a  receiver  or  inactive.  The  owner  of  a  resource  R  is  the  node  that 
injects  R  into  a  session  and  initiates  floor  control  for  R,  which  may  vanish  from  a  session  if  the  owner  leaves.  The  floor 
coordinator  (FC)  is  an  arbiter  over  a  resource  R ,  or  a  session  moderator  granting  or  denying  a  floor  on  R  during  session 
time  to  the  floor  holder  ( FH ),  who  attains  the  exclusive  right  to  work  on  /?  for  a  floor  holding  period.  FC  and  FH  may 
be  located  at  different  nodes,  or  be  assumed  by  the  same  node.  These  roles  may  be  statically  assigned  at  session  start,  or 
rove  among  users  during  session  conduction.  Users  without  control  roles  are  general  session  members,  and  can  be  active 
or  inactive,  depending  on  whether  they  invoke  state  transitions  in  the  coordination  mechanism.  Role-based  floor  control  in 
dynamic  sessions  contrasts  static  role-based  access  control  (RBAC)  models  [33]. 

In  the  list  of  user  attributes  Ajj,  Authority  defines  whether  the  user  is  a  simple  participant,  privileged  as  system  root  user, 
or  moderator,  linking  this  field  with  the  role  entries.  A  moderator  can  be  permanent  FC.  As  a  social  role,  the  moderator 
equates  to  a  session  supervisor  being  able  to  inspect  all  session  turns  between  users.  Identity  specifies  whether  the  user  wants 
to  remain  anonymous  or  whether  the  Uid  can  be  posted  to  the  session.  An  entry  is  either  independent,  i.e.,  unaware  of  the 
actions  and  entries  of  others,  reflective,  i.e.,  polling  session  members,  consultative  based  on  “contextual  clue  messages”, 
partitioned  and  representing  a  subtask,  based  on  voting  among  the  group,  or  debriefed  and  recorded  [12],  In  addition,  user 
entries  may  be  temporary  or  permanent,  and  logged  for  the  purpose  of  reviewing  histories  of  collaborative  sessions,  or  for 
undoing  certain  steps  [28].  Access  defines  the  basic  privileges  for  working  on  a  resource,  in  receive-only,  send-and-receive, 
and  send-only  mode,  analogous  to  read  and  write  authorizations  in  file  systems.  Aggregation  of  users  leads  to  the  notion  of 
groups: 

DEFINITION  4.  A  user  group  (multicast  group)  G  is  a  set  of  users  U  with  common  session  and  user  attributes,  expressing  a 
common  media  or  task  focus,  such  that  U  D  G  D  U . 

Definition  5.  A  resource  R  c  1Z  is  a  tuple 

R  =<  Rid ,  Sid,  Pid,  Uid,  Tc,  T, j,  Ar  >  (4) 

where  Rid  is  a  unique  resource  identifier  owned  by  user  Uid  within  session  Sid,  Pid  is  the  parent  identifier  or  the  resource 
that  Rid  belongs  to,  Tc  is  the  time  of  creation  or  injection  of  the  resource  into  the  collaborative  workspace,  F, j  is  the  deletion 
time,  and  Ar  is  a  list  of  resource  attributes. 

Rid  designates  both  discrete  media  and  streaming  media  and  may  contain  the  port  where  the  resource  is  transmitted. 
The  Pid  value  allows  for  recursive  subsumption  of  resource  components  within  resources,  and  hence  sharing  or  resource 
components  at  an  arbitrary  granularity.  For  instance,  users  can  share  an  entire  window,  or  a  graphical  object  within  that 
window.  Among  the  relevant  resource  attributes  Ar,  Class  describes  whether  the  resource  is  continuous  or  discrete.  Type 
characterizes  the  media  object  class,  indicating  whether  a  resource  is  text-based,  graphical,  or  some  real-time  medium  and 
identifies  the  purpose  it  serves.  Usage  determines  if  the  resource  can  be  used  concurrently  by  multiple  users  or  requires 
sequential  processing  with  exclusive  floors.  For  instance,  a  shared  whiteboard  allows  for  multiple  concurrent  telepointers 
with  a  small  number  of  users,  whereas  a  remotely  controlled  camera  can  only  perform  a  positioning  command  for  one  user 
at  a  time.  Priority  sets  an  importance  value  on  the  transmission  and  processing  of  the  information,  preempting  other  media 
dissemination  of  lower  ratings.  QoS  defines  the  required  Quality-of-Service  [38]  for  the  resource,  including  the  tolerable  loss, 
the  required  resolution,  the  possible  maximum  delay,  and  the  color  depth.  Other  criteria  may  be  added  depending  on  the  nature 
of  the  resource,  such  as  the  channel  number,  a  frame-rate,  encoding  scheme,  sampling  rate  etc.  The  Protection  attributes 
denotes  whether  a  resource  is  public,  private,  or  proctored,  which  may  be  expressed  with  a  numerical  value,  or  work  in  analogy 
with  the  Bell-LaPadula  model  [2],  discerning  between  top-secret,  secret,  confidential,  or  unclassified  information  [13].  The 
last  component  describes  access  privileges  to  the  collaborative  workspace,  called  “floors”: 

Definition  6.  A  floor  F  £  F  is  a  tuple 


F  =<  Fid,  Rid,  Uid,  T],  Tj,  Ap  > 


(5) 


where  Fid  is  a  unique  floor  identifier  within  the  shared  workspace  for  a  resource  Rid,  assigned  to  user  Uid.  at  inception  time 
Tj,  and  deactivated  at  time  Tj,  with  Ap  denoting  a  list  of  floor  attributes. 

Note  that  one  Rid  may  have  multiple  Fid.  assigned  for  control  of  various  granules,  but  each  floor  is  controlling  exactly 
one  resource.  Floors  are  indirectly  associated  with  sessions  via  Rid,  and  floor  properties  may  be  inherited  from  a  master 
resource  to  its  subcomponents.  We  assume  that  one  floor  F  is  assigned  per  resource  component.  The  pairing  (Fid,  Rid) 
specifies  the  granularity  of  control  and  the  commands  available  with  possession  of  the  floor.  A  floor  can  control  an  entire 
conference,  an  application,  a  single  window,  or  a  shared  object  [18].  For  instance,  for  audio  the  associated  commands  may 
be  talk,  mute,  pause.  Video  floor  commands  are  for  instance  caption,  forward,  cut,  replay.  Floors  can 
be  static  relative  to  a  session  lifetime,  or  dynamic,  i.e.,  assigned  ad  hoc  by  a  computer  or  social  protocol.  The  combination 
of  Uid  and  the  attributes  specifies  whether  the  user  is  FC,  FH,  chair,  or  general  participant.  T,  and  Tj  may  be  set  using 
real-time  clocks,  or  a  logical  session  time.  The  floor  attributes  Ap  comprise  directionality  of  control,  state,  instantiation, 
passing  rules,  connection  modality  and  access  strategy. 

The  presented  model  serves  both  theoretical  and  practical  purposes.  It  provides  a  more  elaborate  framework  for  formal 
specification  and  validation  of  collaborative  systems,  e.g.,  with  the  prototype  verification  system  [29].  It  also  allows  for 
session  capability  descriptions  [26]  to  set  up  and  query  the  membership  and  coordination  status  of  an  active  conference,  where 
a  capability  is  understood  as  a  resources  or  system  feature  influencing  the  selection  of  useful  configurations  for  components. 
We  have  also  developed  an  activity  semantics  describing  causality  and  coordination  constraints  using  the  presented  taxonomy. 
It  is  not  our  intention  to  provide  a  comprehensive  model  and  parameterization  of  coordination  services,  but  rather  discuss  key 
concepts  as  a  stepping  stone  toward  more  sophisticated  design  and  deployment  of  software  using  group  coordination. 

4.  SYSTEM  ARCHITECTURE 

In  contrast  to  the  majority  of  commercial  and  experimental  CME  existing  to  date,  we  look  at  collaboration  as  an  inherently 
distributed  process,  where  session  coordination  and  control  are  enacted  collectively  by  participating  hosts,  rather  than  fixing 
such  roles  in  centralized  servers.  We  postulate  henceforth  a  coordination  architecture  with  the  following  requirements:  sim¬ 
plicity  of  implementation  and  maintenance;  scalability  in  the  number  of  users  and  hosts;  security  with  regard  to  the  exchange 
of  coordination  information  and  data;  extensibility  for  new  resources  and  session  models;  efficiency  in  coordination,  con¬ 
cerning  low  latency  and  protocol  state  overhead;  reliability  with  regard  to  failures  of  hosts,  resources,  and  network  elements; 
persistence  of  coordination  information  at  hosts  despite  the  ephemeral  nature  of  access  permission  exchanged  between  collab¬ 
orating  sites;  and  interoperability  between  heterogeneous  platforms.  A  more  elaborate  view  on  this  architecture  is  presented 
in  [9], 


5.  CONCLUSION 

A  comprehensive  framework  for  group  coordination  in  networked  multimedia  systems  has  been  presented.  The  framework 
has  its  foundation  in  a  formal  model  of  group  coordination  and  collaboration,  related  to  hierarchical  session  control  and  role- 
based  session  participation,  revolving  around  the  notion  of  turn-taking  in  interactive  groupwork.  Important  design  issues  for 
such  an  architecture  have  been  discussed  in  conjunction  with  the  various  media  and  session  types  and  their  properties.  Our  co¬ 
ordination  model  does  not  represent  a  panacea  for  the  many  open  problems  encountered  in  groupware,  CSCW  and  networked 
multimedia  systems.  Rather,  we  intend  it  to  be  an  integrative  step  towards  a  better  understanding  of  group  collaboration,  and 
more  flexible,  rich  middleware  services  to  facilitate  it. 
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